Term Project Proposal: Building a Data-Based Named Entity Recognizer for German within the D-SPIN Project∗

نویسنده

  • Maria Schmidt
چکیده

In this section, I would like to introduce the project. This is done in more detailed subcategories: section 1.1 deals with the idea underlying the project. Then section 1.2 argues about the goal of the project while section 1.3 is concerned with possible extension points. Furthermore, section 2 pinpoints to related research in this area and to the knowledge base as such. Finally, section 3 until section 5 highlight the scope of the project, its deliverables, and a time plan containing milestones.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Improvement of Chemical Named Entity Recognition through Sentence-based Random Under-sampling and Classifier Combination

Chemical Named Entity Recognition (NER) is the basic step for consequent information extraction tasks such as named entity resolution, drug-drug interaction discovery, extraction of the names of the molecules and their properties. Improvement in the performance of such systems may affects the quality of the subsequent tasks. Chemical text from which data for named entity recognition is extracte...

متن کامل

Training and Evaluating a German Named Entity Recognizer with Semantic Generalization

We present a freely available optimized Named Entity Recognizer (NER) for German. It alleviates the small size of available NER training corpora for German with distributional generalization features trained on large unlabelled corpora. We vary the size and source of the generalization corpus and find improvements of 6% F1 score (in-domain) and 9% (out-of-domain) over simple supervised training.

متن کامل

Within-Word vs. Across-Word Decoding for Online Speech Recognition

In this paper we describe methods for improving the RWTH German speech recognizer used within the VERBMOBIL project. In particular, we present acceleration methods for the search based on both within-word and across-word phoneme models. The recognizer in the VERBMOBIL project is used in an online environment. We will discuss some incremental methods to reduce the response time of an on-line spe...

متن کامل

Project Managers Competencies based on ICB and Project Management Processes based on PMBOK in Project Based Organization (Case study: Hydropower Plants Management)

Effective implementation of managerial systems needs software and hardware requirements. Project management competencies of the managers is one of the most important and inevitable requirements to ensure the success of the projects in any industry. Inorder to clarify the requirements, many international and professional instituts have presented well-known frameworks to help the managers to shap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011